FAST-PCA: A Fast and Exact Algorithm for Distributed Principal Component Analysis

نویسندگان

چکیده

Principal Component Analysis (PCA) is a fundamental data preprocessing tool in the world of machine learning. While PCA often thought as dimensionality reduction method, purpose actually two-fold: dimension and uncorrelated feature Furthermore, enormity dimensions sample size modern day datasets have rendered centralized solutions unusable. In that vein, this paper reconsiders problem when samples are distributed across nodes an arbitrarily connected network. few for exist, those either overlook learning aspect PCA, tend to high communication overhead makes them inefficient and/or lack ‘exact’ or ‘global’ convergence guarantees. To overcome these aforementioned issues, proposes algorithm termed FAST-PCA (Fast exAct diSTributed PCA) . The proposed efficient terms proven converge linearly exactly principal components, leading well features. claims further supported by experimental results.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast, Exact Bootstrap Principal Component Analysis for p > 1 million.

Many have suggested a bootstrap procedure for estimating the sampling variability of principal component analysis (PCA) results. However, when the number of measurements per subject (p) is much larger than the number of subjects (n), calculating and storing the leading principal components from each bootstrap sample can be computationally infeasible. To address this, we outline methods for fast...

متن کامل

A Fast Incremental Multilinear Principal Component Analysis Algorithm

This study establishes the mathematical foundation for a fast incremental multilinear method which combines the traditional sequential Karhunen-Loeve (SKL) algorithm with the newly developed incremental modified fast Principal Component Analysis algorithm (IMFPCA). In accordance with the characteristics of the data structure, the proposed algorithm achieves both computational efficiency and hig...

متن کامل

Fast principal component analysis using fixed-point algorithm

In this paper we present an efficient way of computing principal component analysis (PCA). The algorithm finds the desired number of leading eigenvectors with a computational cost that is much less than that from the eigenvalue decomposition (EVD) based PCA method. The mean squared error generated by the proposed method is very similar to the EVD based PCA method. 2007 Elsevier B.V. All rights ...

متن کامل

Fast Iterative Kernel Principal Component Analysis

We develop gain adaptation methods that improve convergence of the Kernel Hebbian Algorithm (KHA) for iterative kernel PCA (Kim et al., 2005). KHA has a scalar gain parameter which is either held constant or decreased according to a predetermined annealing schedule, leading to slow convergence. We accelerate it by incorporating the reciprocal of the current estimated eigenvalues as part of a ga...

متن کامل

FRPCA: Fast Robust Principal Component Analysis

While the performance of Robust Principal Component Analysis (RPCA), in terms of the recovered low-rank matrices, is quite satisfactory to many applications, the time efficiency is not, especially for scalable data. We propose to solve this problem using a novel fast incremental RPCA (FRPCA) approach. The low rank matrices of the incrementally-observed data are estimated using a convex optimiza...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Signal Processing

سال: 2022

ISSN: ['1053-587X', '1941-0476']

DOI: https://doi.org/10.1109/tsp.2022.3229635